Modeling context-dependent phonetic units in a continuous speech recognition system for Mandarin Chinese
نویسندگان
چکیده
We study the problem of phonetic modeling for continuous Mandarin speech recognition by providing a systematic performance comparison for systems based on following primitive speech units: syllable, demi-syllable (Initials and Finals), context-independent phones, left-or-right context-dependentphones (diphones), and leftand-right context-dependent phones (triphones). In our speakerdependent continuous speech recognition experiments, a generalized triphone system has achieved the best performance among all. Our best system contrasts most other Mandarin speech recognition systems which have been based on demi-syllable units.
منابع مشابه
Improved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition
Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...
متن کاملUsing Chi-Square Testing in Modeling Confusion Characteristics for Robust Phonetic Set Generation
A phonetic representation of a language is used to describe the corresponding pronunciation and synthesize the acoustic model of any vocabulary. In order to obtain better phonetic representation, context-dependent units are used to model co-articulation effects between phones and have been broadly in speech recognition. However, this representation generally increases the number of recognition ...
متن کاملPhonetic Modelling in the Philips Chinese Continuous Speech Recognition System
We have extended the Philips large vocabulary continuous speech recognition system towards Chinese On the way from our existing Western language technology to Mandarin the rst step was to build a suitable phonetic model This paper describes the development of our phonetic model excluding tones for Mandarin Chinese We will present a systematic comparison of three forms of sub syllabic units for ...
متن کاملGeneration of robust phonetic set and decision tree for Mandarin using chi-square testing
A phonetic representation of a language is used to describe the corresponding pronunciation and synthesize the acoustic model of any vocabulary. A phonetic representation with smaller phonetic units such as SAMPA-C for Mandarin Chinese and decision trees for parameter sharing are broadly applied to deal with the problem of large numbers of recognition units. However, the confusable phonetic rep...
متن کاملTwo-stream modeling of Mandarin tones
Tone modeling is a critical component for Mandarin largevocabulary continuous-speech recognition systems. In previous work on pitch-feature extraction, we reported character error rate reductions of over 30% over the non-tonal baseline [1]. In this paper, we investigate how best to integrate tone modeling with a Mandarin LVCSR system. The paper focusses on the two-stream method, which is based ...
متن کامل